Millau: an encoding format for efficient representation and exchange of XML over the Web
نویسندگان
چکیده
XML is poised to take the World Wide Web to the next level of innovation. XML data, large or small, with or without associated schema, will be exchanged between increasing number of applications running on diverse devices. Efficient storage and transportation of such data is an important issue. We have designed a system called Millau for efficient encoding and streaming of XML structures. In this paper we describe the Millau algorithms for compression of XML structures and data. Millau compression algorithms, in addition to separating structure and text for compression, take advantage of the associated schema (if available) in compressing the structure. Millau also defines a programming model corresponding to XML DOM and SAX for XML APIs for Millau streams of XML documents. Our experiments have shown significant performance gains of our algorithms and APIs. We describe some of these results in this paper. We also describe some applications of XML-based remote procedure calls and client-server applications based on Millau that take advantage of the compression and streaming technology defined by the system.
منابع مشابه
Automatic Generation of OWL Ontology from XML Data Source
The eXtensible Markup Language (XML) can be used as data exchange format in different domains. It allows different parties to exchange data by providing common understanding of the basic concepts in the domain. XML covers the syntactic level, but lacks support for reasoning. Ontology can provide a semantic representation of domain knowledge which supports efficient reasoning and expressive powe...
متن کاملXPACK: A High-Performance WEB Document Encoding
XML is an increasingly popular data storage and exchange format whose popularity can be attributed to its self-describing syntax, acceptance as a data transmission and archival standard, strong internationalization support, and a plethora of supporting tools and technologies. However, XML’s verbose, repetitive, text-oriented document specification syntax is a liability for many emerging applica...
متن کاملA Complete Search Engine for Efficient and Data Integration Using Fuzzy Search
As the next generation of the Web language, XML is straightforwardly usable, which has been the de-facto standard of information representation and exchange over the Web. XML employs a tree-structured data model, and XML queries specify patterns of selection predicates on multiple elements related by a tree structure. Due to increase in web-based applications, searching for all occurrences of a...
متن کاملXML Schema in XML Documents with Usage Control
With an increasing amount of semi-structured data, XML has become significant to humans and programs. XML promoted by the World Wide Web Consortium (W3C) is rapidly emerging as a new standard language for semi-structured data representation and exchange on the Internet. XML documents usually contain private information that cannot be shared by all user communities. So securing XML data is becom...
متن کاملInformation Retrieval Systems in XML Based Database – A review
XML the eXtensible Markup Language has emerged as a new standard for data representation and exchange over the Internet. It will become a universal format for data exchange on the Web and that in the near future we will find vast amounts of documents in XML format on the Web. As a result, it has become crucial to sort large collections of XML documents and retrieve relevant information from the...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Computer Networks
دوره 33 شماره
صفحات -
تاریخ انتشار 2000